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Keyword: reflectivity information is rooted in the DWR image with the help of colors 
Back propagation and color bar is provided to distinguish among different reflectivity 
a pe information. Artificial Neural network predicts the color based on the 
Color classification maximum likelihood estimation problem. This paper presents a best possible 
Doppler Weather Radar backpropagation algorithm for color identification in DWR images by 
Reflectivity comparing various backpropagation algorithms such as Levenberg- 
Marquardt, Conjugate gradient, and Resilient back propagation etc.,. Pattern 
recognition using Neural networks presents better results compared to 
standard distance measures. It is observed that Levenberg-Marquardt 
backpropagation algorithm yields a regression value of 99% approximately 

and accuracy of 98%. 
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1. INTRODUCTION 

Clouds are helpful in maintaining the earth’s energy balance. Clouds are classified based on their 
structure and height in the atmosphere which influences the radiation budget in different ways. Apart from 
cloud height in the atmosphere, Cloud optical thickness plays a crucial role in the cloud classification. Cloud 
optical thickness is directly related to the reflectivity of the DWR image [1]. The higher the reflectivity value, 
the thicker the cloud. The detection of convective clouds based on the reflectivity parameter from the MAX- 
Z product of DWR images plays a key role in estimating the amount of precipitation intensity. The 
convective clouds are classified based on the reflectivity parameter by professional humans at the Indian 
Meteorological Department (IMD), which will sometimes lead to contradicting the results due to man-made 
errors. Hence, retrieval of reflectivity information from the DWR images without human intervention is 
essential in this field. Many researchers in this field work on the DWR raw and processed data rather than the 
image to exploit the relation between reflectivity and rainfall rate. The processed products available with the 
IMD are Reflectivity, Surface Rainfall Intensity (SRI), Precipitation Accumulation (PAC), Plan Position 
Indicator (PPI), Plan Position Indicator - Close Range and Volume Velocity Processing. 

Color images communicate a large amount of information rather than the grayscale image or the 
binary image [2, 3]. Color images are preferred over gray images because they convey more information with 
minimal effort [4]. The DWR images convey information about Reflectivity, Surface Rainfall Intensity, 
Precipitation Accumulation, Wind direction, and Intensity. The value of the product at a particular location 
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and time is identified with the help of color bar provided on the right hand side of the image. A large number 
of distance metrics are available in literature till now for color matching. 

Pattern recognition is a branch of machine learning, which focuses on recognizing different patterns. 
Pattern Recognition provides the solution to speech recognition, recognition of handwritten characters, face 
recognition and medical diagnosis. Artificial Neural Networks (ANN) are useful where the limits between 
different patterns are not defined precisely [5]. 

In supervised learning, input patterns and its corresponding output patterns are provided for learning 
which is used to adjust the neuron weights whereas, unsupervised learning no training samples are provided. 
In pattern recognition, outputs are classified based on input sets, bias weights and neuron weights. Supervised 
classification finds its application in the area of curve fitting, time series prediction, etc. Unsupervised 
classification finds its application in the area of clustering. The reinforcement learning is a type of machine 
learning which aims at maximizing the performance for a specific problem of context. The output of the 
network depends on the past experiences which is a trial and error approach. 

This paper focuses on color identification of DWR images for reflectivity extraction using Artificial 
Neural Networks. In Section 2, the basics of Artificial Neural Networks are discussed. In Section 3 different 
Back Propagation Neural Network methods are discussed. In section 4, Research Method to extract 
reflectivity from DWR images is discussed. In section 5 results and discussion are discussed, followed by 
conclusions and by the references. 


2. ARTIFICIAL NEURAL NETWORKS 

ANN functions in a similar way to that of the brain. The basic structure of Artificial Neural Network 
(ANN) is shown in Figure 1. The network shown in Figure 1 has m input samples, n output samples, and k 
hidden layers. The network is provided with sample inputs and it’s corresponding sample outputs to train and 
adjust the output and hidden layer neuron weights [6]. Each input signal x1, x2, ... xn is modeled by some 
weight values wl, w2, ... wn. The sum of the product of the inputs and the weights is applied to a 
thresholding function (also called as activation function), which model the output. The constant difference 
between actual value and the desired value is adjusted using a bias value. The basic ANN equations are 
shown in Equation 1 and 2. There are a wide variety of activation function such as binary threshold function 
(also referred to as Heaviside function), Fermi function (logistic function), hyperbolic tangent etc., available 
in the literature. The thresholding function will be chosen depending on the nature of the problem. 


Input Hidden Output 
layer layer layer 
4 
”2 
Xm K 
Figure 1. Artificial Neural Networks 
Net=} -1 x; * w; + bias (1) 
Output=f (Net) (2) 


ANN has proven to be an efficient alternative to traditional methods of distance measures [7, 8, 9]. 
The backpropagation learning algorithm is used for training multilayer perceptrons (MLP). The MLP consists 
of a set of an input layer for receiving inputs, one or more hidden layers for computation, and an output layer 
for presenting the output. The signal is applied at the input layer which propagates to the output through the 
hidden layer [10, 11]. Input layer communicates with the external world and presents a pattern to the 
network. This process is called as excitation. The output layer presents a pattern to the external world. The 
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number of outputs depending on the type of problem. Hidden layer acts as an intermediate layer between 
input and output. Hidden layers are not required for linear separable problems. Depends on the complexity of 
the problem, the number of hidden layers is decided. 

The neural network has to be provided with a sufficient number of input samples for training. 
Underfitting is a condition when the number of input samples is less than the minimum number of input 
samples required for training. When the number of input samples is more than the minimum number of input 
samples required for training leads to overfitting. The number of hidden layer neurons required for a problem 
is 2/3rd the sum of the number of input and output neurons [12]. In order to save computation time and 
memory, the number of hidden layers in first and second layer should be almost same. There are different 
algorithms used in this paper for training the neural network is Levenberg-Marquardt, Conjugate gradient, 
and resilient backpropagation algorithm [13]. The error of the network is defined as the difference between 
desired target and network response. The performance of the network is evaluated using different metrics 
such as mean square error (RMSE), Mean Absolute Error (MAE), Mean Absolute Deviation (MAD) as 
shown in Equation 3, 4 and 5. The main focus of this back propagation algorithm is to reduce the global error 
as minimum as possible. 


N -—d;)2 
RMSE-= i-i di) (3) 
N 


1 
MAE=— Lily- dil (4) 


MAD=—YiLalyi — Jil (5) 


3. THE DIFFERENT BACK PROPAGATION NEURAL NETWORK METHODS 

There are various Backpropagation algorithms supported in the literature. Of them, Gradient 
Descent (GD), Gradient Descent with Momentum (GDM), Variable Learning Rate with Momentum (GDX), 
Conjugate Gradient (CGP), Quasi-Newton (BFGS), Levenberg-Marquardt (LM), and Resilient back 
propagation (RB) are used to adjust the weights of the Neural network. 

In the gradient descent Back Propagation algorithm, bias weights and network weights are updated 
in the direction of negative gradient performance function [14, 15]. The parameter n is the learning rate 
parameter which has a direct influence on the training the network. Gk is the error gradient with respect to 
the weight vector. The updated weight vector is given in Equation 6 [16]. Gradient Descent suffers from the 
shallow local minimum. GDM can skip such minimum values by updating the weight values equal to the sum 
of modified weight in Gradient descent and the fraction of previous weight values as given in Equation 7. 
The parameter u is the coefficient of momentum and it varies between 0 and 1. 


Wk+1=Wk -n * Gk (6) 
Wk+1=Wk — n * Gk - p * Wk-1 (7) 


The GD and GDM method suffers from the problem of low convergence rate. The learning rate 
parameter value has a direct relation to the convergence. As the learning rate increases convergence value 
increases. The algorithm takes a long time to converge if the value of the learning rate parameter is high and 
it leads to an unstable network. To overcome this problem, variable learning rate backpropagation with 
momentum is used. In this algorithm, the value of 7 is large initially and it decreases as time progresses. The 
weight adjustment in GDX is given by equation 8. 


Wk+1=Wk - nk+1* gk + u * Wk-1 (8) 


The methods discussed till now uses the steepest descent method which works at the direction of the 
negative gradient for modifying the neuron weights. The convergence rate of these methods is very slow. In 
order to improve the convergence rate, the conjugate direction of the search is preferred over steepest descent 
method. Conjugate gradient descent back-propagation algorithm (CGD-BP) is used for training purpose. In 
CGP search is performed along the conjugate gradient direction which will minimize the cost function along 
the line by adjusting the step size. The weight update in the conjugate gradient method is given in equation 
9[17]. The direction of Conjugate gradient search is given in equation 10. Bk is the ratio of norm squared of 
the current gradient to norm squared of the previous gradient as shown in the equation in 11. 
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Wk+1=Wk +n * Pk (9) 
Where, Pk=-gk + Bk * Pk-1 (10) 
_ Agh_1* 8k 
= ek-1 bk 11 
Bx gh_1* &k-1 ( ) 


Levenberg-Marquardt algorithm aimed at speeding up the training without computing the Hessian 
matrix directly. The Hessian matrix is computed using the Jacobian matrix as shown in Equation 12. The 
principal diagonal elements of Hessian matrix are larger than zero. The weight update rule in the Levenberg- 
Marquardt algorithm is presented in Equation 13. 


H=JTJ+ul (12) 
Wk+1=Wk — (JkT Jk + u I )- 1 Jk ek (13) 


The Levenberg-Marquardt switches between steepest descent algorithm and Gauss Newton 
algorithm during the training phase. The convergence rate of the Gauss Newton method is fast and unstable, 
whereas, Levenberg-Marquardt overcomes the problem instability by maintaining the convergence rate fast. 
A Gauss Newton algorithm is used when the coefficient u is very small. The Steepest Descent method is used 
when u is very large. The relation between learning rate n and the combination coefficient u is given by the 
following relation as shown in Equation 14. 


n=l/p (14) 


The activation function typically used in a multilayer network is a sigmoid transfer function. The 
primary role of activation function is to compress the infinite input range to a finite output range. For higher 
values of inputs, the slope of the activation function approximates zero. This creates a problem while training 
multilayer networks since the gradients have a small magnitude. To eliminate these effects, resilient 
backpropagation training algorithm is used. Sign of the derivative plays a crucial role in the weight update 
rather than the magnitude of the derivative. If the derivative of the performance function for two successive 
iterations has the same sign, then the weight update value and bias values will be increased otherwise it has 
the decreasing pattern. If the derivative is zero, then there is no need to update the weights. 


4. RESEARCH METHOD 

The present work focuses on extracting the reflectivity values from the DWR image. The similar 
approach is used to extract the SRI and PAC from the DWR images. The images obtained from DWR MAX 
(Z) product of IMD Chennai contains the Reflectivity information about convective clouds with Chennai as a 
center and spans around 250Km circular area. A sample DWR MAX (Z) image captured on 19th July 2017 is 
shown in Figure 2 [18]. The reflectivity information on a particular location on the DWR image is indicated 
using 17 different colors. The color and its corresponding reflectivity are provided on the right side of the 
image [19, 20]. The reflectivity value greater than 60 dbZ indicates strong precipitation and hail. The DWR 
image contains the reflectivity values from 20 dBZ to 65 dBZ. 

The horizontal, vertical resolution of the DWR image is 1 Km/Pixel, 0.089 Km/pixel respectively. 
The DWR image covers a circular geographical area of 250 Km radius with Chennai as the center. The color 
information provided on the top of the image provides the maximum value of reflectivity seen along the line 
from north to south. The color information provided on the right of the image provides the maximum value 
of reflectivity seen along the line from east to west. The range of convective cloud spans from 0.1 Km to 18 
Kms. The DWR image is processed to extract the convective information by eliminating the static 
background image. The steps involved in preprocessing the DWR image are noise filtering, morphological 
operations such as closing and opening of an image, thresholding and image subtraction. The colors of the 
convective extracted image are compared against the color bar provided on the image to estimate the 
reflectivity values. 

DWR image contains seventeen different reflectivity values, and each value represents the different 
amount of Reflectivity information. The Reflectivity value in DWR ranges from 20 dBZ to 65 dBZ in steps 
of 2.5 dBZ. The estimated reflectivity value from the DWR image using ANN has the continuous values 20 
dBZ to 65 dBZ. The value is rounded off to the nearest value provided on the color bar chart. The 
Reflectivity value for each color combination is tabulated as shown in Table 1. 
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Figure 2. DWR Reflectivity image from IMD Chennai 


Table 1. Color composite of Reflectivity values 
Reflectivity (in dBZ) Color values (RGB) 


20.0 [000.6] 
22.5 [000.8] 
25.0 [00.2 1] 
27.9 [00.4 1] 
30.0 [0 0.6 1] 
32.5 [0.2 0.8 1] 
35.0 [0.4 1 1] 
37.5 [111] 
40.0 [110.8] 
42.5 [110] 
45.0 [10.8 0] 
47.5 [10.40] 
50.0 [10.2 0] 
52.5 [0.8 0 0] 
55.0 [0.6 0 0] 
60.0 [0.6 0 0.20] 


5. RESULTS AND DISCUSSIONS 

The reflectivity value of an image is represented by a unique combination of color values in the 
RGB model. Test patterns were generated with 10% deviation in the color composite. The test patterns are 
used to train, validate and test the network [21]. The networks have three inputs for which Red, Green, and 
Blue color values at a certain location are given as inputs. It has one output which is used to classify the 17 
different reflectivity values [22]. In order to save time, memory and complexity of the network, the number 
of hidden layers used in this model is three. The block diagram of the neural network model is shown in 
Figure 3. 

The neural network is trained with different types of back propagation algorithms such as variable 
learning rate backpropagation, Levenberg-Marquardt, One step secant, scaled conjugate gradient, and 
resilient back propagation [23]. The network is provided with 5100 different samples for training and testing. 
Out of which 80% of the samples is used for training the network, 10% of the samples are used for testing the 
network and the remaining 10% of the samples are used for validating the network. The results are compared 
over repeated iterations by shuffling the training sample values. The error histogram is a plot between error 
value and the number of instances the error has occurred. The error histogram of 20 bins is plotted as shown 
in Figure 4. The center of the histogram has the minimum error and the error increases as we move away 
from the center. The second plot which is trained with Levenberg-Marquardt has minimum error compared to 
the other algorithms. The plot shows that 99% of the samples fall within the range of +1which falls within the 
tolerable error limit of +1.25. The reflectivity values will be in steps of 2.5 dBZ. Any value falls within the 
range of 2.5 dBZ will not be considered as an error. For example, the reflectivity range of 25dBZ is from 
23.75 dBZ to 26.25 dBZ. 
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Hidden Output 


3 1 


Figure 3. Block diagram of Neuron network model 


The dashed line the regression plots indicates the outputs. The best possible fit between network 
outputs and desired targets is indicated by a solid dash line. The relation between outputs and targets is 
indicated by the regression value. The regression plot gives information about how close the output of your 
model is to the actual target values. The network outputs have a strong linear relation to desired targets if the 
value of Regression coefficient approaches unity. If the value of regression coefficient approaches zero, the 
relation between output and targets cannot be predicted. The regression plots for GDX, LM, OSS and SCG 
model is shown in Figure 5. The regression values for the GDX, LM, OSS, and SCG are 0.98697, 0.99812, 
0.99346 and 0.98499 respectively. The LM backpropagation algorithm shows better performance compared 
with the other three models. 

The performance plot is a plot between Mean Square Error (MSE) and the number of epochs. MSE 
is the average squared difference between outputs and targets. MSE of Zero implies no error. As the training 
process progresses, the MSE value reduces. When the MSE value is reduced to a minimum value, the 
training stops and the network are validated with the samples. In the validation phase, if the network behaves 
properly, then the training stops and it is ready for testing. The MSE values for GDX, LM, OSS, and SCG are 
3.8841, 0.4964, 1.8942 and 4.4903 respectively. The LM shows better performance compared to other 
methods based on MSE. 

In the previous work, Refelctivity extraction over a location is performed using different distance 
measures such as Euclidean, Standard Euclidean, City block, Minkowski, Chebychev, Mahalanobis, cosine 
and correlation [24, 25]. Color classification accuracy, using conventional distance measures is 95 % for a 
standard Euclidean distance over different color spaces such as RGB, HSV, YCbCr and La*b*. The accuracy 
percentage for different standard distance measures and different neural network algorithms are tabulated in 
Table 2. The accuracy percentage values tabulated in Table 2 are the average values of repeated iteration. 
Reflectivity extraction using neural networks provides better results compared with the previous work. 
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Figure 4. Error histogram plots (a) GDX (b) LM (c) OSS (d) SCG 
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Figure 4. Error histogram plots (a) GDX (b) LM (c) OSS (d) SCG 
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Figure 5. Regression plots (a) GDX (b) LM (c) OSS (d) SCG 
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Figure 6. Performance Curves (a) GDX (b) LM (c) OSS (d) SCG 


Table 2. Accuracy Percentage in RGB Space for Different Distance Metrics 


Method % of Accuracy 
Euclidean 6.20 
Seuclidean 6.09 
Cityblock 6.29 
Distance Measures ie lee 
Mahalanobis 13.22 
Cosine 25.73 
Correlation 41.53 
GDX 13.18 
LM 1.82 
Neural Network Methods Oss 482 
SCG 7.22 


6. CONCLUSION 

In this paper, extraction of the Reflectivity parameter from the DWR image is done with the help of 
artificial neural networks. The network is provided with 5100 sample inputs to classify the 17 different target 
outputs. The network is trained with different types of backpropagation algorithms such as LM, GDX, OSS, 
and SCG. The network trained with LM gives better performance and regression values. Even though the 
regression value changes for every iteration, LM proves to be the best compared to the other methods. LM 
presents better accuracy compared to the traditional standard distant measures. 
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